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(57) Abstract 

The present invention is a method for the isolation and characteri- 
zation of C glutamicum genes involved in amino acid biosynthesis, spe- 
cifically, encoding horn, thrB, and thrC^ and sequences regulating their 
expression. Techniques for modifying or replacing these sequences and 
means for facilitating further isolations and characterizations, including 
promotor probe vectors which are useful in screening for high efficiency 
and regulatable promoters and repressors, are also disclosed. A C. giu- 
tamicum genomic library was constructed by cleaving chromosomal 
DNA with restriction enzymes, inserting the DNA fragments into an ap- 
propriate vector, and transforming the resulting recombinant molecules 
(rDNA) into C. glutamicum, Amino acid biosynthetic genes Aom, thrB, 
and thrC, encoding homoserine dehydrogenase, homoserine kinase, and 
threonine synthetase, respectively, were isolated by complementation of 
C. glutamicum auxotrophs. The hom-thrB genes were subcloned on a 3.6 
kb Sal\ generated chromosomal fragment while thrC activity was isolat- 
ed from a second recombinant plasmid within the genomic library and 
subcloned on a 2.7 kb Sph\ generated fragment. The hom-thtB and thiC 
loci, and regulatory sequences, were identified by enzyme assays, com- 
plementation of defined E. coli auxotrophs, SI nuclease and deletion 
mapping. 
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C> GLUTAMICUM THREONINE BIQSYNTHETIC PATFTOAY 

Background of the Inven-fcion 

The present invention is generally in the field 
of genetic engineering, and specifically, in the 
area of manipulation of amino acid biosynthesis in 
Gram positive bacteria o 

Corynebacterium glutamicum is a Gram positive, 
nonpathogenic microorganism that has long occupied a 
central role in the industrial production of amino 
acids by conventional fermentation processes. Past 
strain development has primarily depended on classi- 
cal mutagenesis to remove competing pathways to 
thereby increase substrate availability, and to 
remove or reduce regulatory control of a part-icular 
biosynthetic pathway. Regulatory mutants were 
isolated by selecting strains resistant to toxic 
amino acid analogues. The use of chemical muta- 
genesis has been very successful and a number of 
economically viable L-amino acid fermentation 
strains, such as strain producing L-glutamate and 
L-lysine, have been estcQDlished . 

The recent development of cloning vectoirs, 
including those described in U.S. Patent No. 
4,649,119 to Sinskey et al . , and methods for DNA 
transformation of glutamicum ^ as decribed by 
Katsumata et al . , J. Bacteriol . 159,306-311 (1984), 
and Yoshihama et al., J, Bacteriol. 162, 591—597 
(1985) , and the closely related Corynebacterium 
(Brevibacterium) lactof ermentum described by 
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Santamaria et al. in J. Gen. Microbiol. 13 o, 
2237-224S (1984), initiated a new era in the genetic 
manipulation of these organisms - 

However, the commercial utilization of c. 
«g-lutamicma recombinant DNA technologies for future 
strain development is dependent on the development 
of additional genetic tools and a better understand- 
ing of the fundamental molecular biology of this 
species. The use of recombinant DNA techniques to 
develop industrial strains would offer several 
advantages over classical mutagenic strategies.. . For 
example, specific alterations such as the replace- 
ment of a low efficiency promoter would be possible, 
the stepwise isolation of enhancing mutations could 
be avoided, regulatory systems could be engineered 
to allow the temporal control of gene expression 
during a fermentation process, and novel genes 
and/or pathways could be introduced into an 
organism.. 

It is therefore an object of the present 
invention to isolate and characterize genes encoding 
components of amino acid biosynthetic pathways in 
CorynebacteriuTn _ 

It is another object of the present invention 
to clone the isolated amino acid biosynthetic genes, 
specifically those involved in the threonine bio- 
synthetic pathway. 

It is still another object of the present 
invention to elucidate the structure of these genes 
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and the regulatory mechanisms that modulate their 
expression. 

It is a further object of the present invention 
to characterize and modify the expression of the 
cloned, amino acid biosynthetic genes, as well as 
the primary structure and regulatory features of 
their protein products. 

Summary of the Invention 

The present invention is a method for the 
isolation and characterization of C. glutamiciim 
genes involved in amino acid biosynthesis, 
specifically, horn , thrB, and thrC, and sequences 
regulating their expression. Techniques for modify- 
ing their expression and regulation are also des- 
cribed. Methods and sequences facilitating further 
isolations and characterizations are also disclosed, 
including promoter probe .'vectors which are useful in 
screening for high efficiency and regulated 
promoters • 

A C. glutamicum genomic library was constructed 
by cleaving chromosomal DNA with the restriction 
enzyme Mbol, inserting the resultant DNA fragments 
into a C, qlutamicum/Bacillus subtil is shuttle 
vector, pHY416, and transforming the resulting 
recombinant molecules into C, glutamicum . Amino 
acid biosynthetic genes horn, thr B, and thr C, encod- 
ing homoserine dehydrogenase, homoserine kinase, and 
threonine synthase, respectively, were isolated by 
complementation of C. glutamicum auxotrophs. The 
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hom-thrB genes were subcloned on a 3.6 kb Sail 
generated chromosomal fragment while thr c I^ivity 
was isolated from a second recombinant plasmid 
within the genomic library and subcloned on a 2.7 kb 
SEhl generated fragment. The hom-thrB and thrC loci 
were identified by a combination of enzyme assays 
and complementation of defined E. coli auxotrophs, 
and amino acid sequence homology. 

Enzymatic assay of homoserine dehydrogenase 
activity, encoded by horn, in strains harboring the 
cloned gene demonstrated a 20- fold increase in 
specific activity compared to wild type controls - 
Both the chromosomal and plasmid encoded activities 
are strongly inhibited by L-threonine and repressed 
by L-methionine. The L-methionine repression of the 
plasmid encoded activity demonstrates that the 
structural gene and sequences responsible for its 
expression are included within the cloned fragment. 
Southern hybridization analysis demonstrated that 
the hom/thrB and thrC. loci are separated by a 
minimum of a . 3 kb in the c. glutamicum chromosome. 
This is a different genomic organization from that 
observed in E, coli where the three genes represent 
a single operon. Three lines of evidence demon- 
strate that the C. Qlutamicmn hom-thrB genes repre- 
sent an operon. First, they are located together 
(separated by ii base pairs) and coordinately 
regulated by L-methionine. Secondly, Northern 
hybridization analysis has identified a single 2.4 
kb, L-methionine repressed RNA transcript. 
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consistent with the size of the two coding regions. 
Finally, deletion of the promoter upstream of the 
^QJn gene significantly reduces the expression of 
both the horn and thr B genes. 

The hom-thrB and thr C promoters were identified 
by complementation of auxotrophs^ deletion analysis 
and SI nuclease mapping. The hom - thr B operator, a 
hyphenated dyad symmetry element, was also identi- 
fied by deletion analysis. Methods for modifica- 
tion, removal or replacement of these regulatory 
elements are described. 

Brief Description of the Drawings 

Figure 1 is a schematic of the threonine 

biosynthetic pathway. 

Figure "2 is a graphic depiction of subcloning 

strategy and restriction maps of recombinant plas- 

mids pFS78, pFSSO, pFS3.6A, pFS3,6B, pSPCl and 

pSPC4 . 

Figure 3 is the nucleotide seqpaence and pre- 
dicted protein sequences of horn and thr B. 

Figure 4 is the nucleotide sequence and pre- 
dicted protein sequence of the thrC gene. 

Figure 5a is the sequence of the C. glutamicum 
hom - thr B regulatory region indicating the mRNA 
initiation site, -35 and -10 regions of thrPl and 
the hyphenated dyad symmetry element responsible for 
methionine mediated repression (thrO) . 

Figure 5b is the potential stem/loop structure 
formed by the hyphenated dyad symmetry element. 
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Figure 6 is a schematic of the construction of 
C. glutamicvm hom-thrB promoter deletions and subse- 
quent analysis. 

Figure 7: Deletional analysis of the hom-thrB 
promoter region. The 3.6 kb C. glutamicum chromo- 
somal DNA insert of pFS3.6 carrying the hom-thrB 
genes is indicated as a hatched box and the nucleo- 
tide sequence of the relevant promoter-containing 
Dral-Hindlli fragment is shown. The extent of Bal31 
generated deletions in various plasmid constructs 
based on vector pWSTl are presented as black bars. 
The start of transcription as determined by si 
nuclease mapping is indicated by an arrow. 

Detailed Desc ription of the Invenfion 

Recombinant DNA technology has been used to 
isolate, characterize and manipulate genes involved 
in the amino acid biosynthetic pathway of Coryne- 
bacterium glutamicum. The technology and results 
obtained aid in the elucidation of the fundamental 
molecular biology of glutamicum and construction 
of amino acid producing strains, particularly 
threonine. 

Threonine is produced in a series of reactions 
beginning with the reduction of the beta-carboxyl 
group of aspartic acid to from the aldehyde, 
aspartic beta-semialdehyde, which takes place via an 
acyl phosphate intermediate, beta-aspartyl phos- 
phate, in an ATP requiring reaction. Aspartic 
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beta-semialdehyde is converted by homoserine de- 
hydrogenase, encoded by hom, to homoserine- Homo- 
serine is phosphorylated by homoserine kinase, 
encoded by thrB, to homoserine phosphate in an ATP 
requiring reaction. The product, homoserine phos- 
phoric acid, is in turn converted to threonine by 
threonine synthase, encoded by thr C, a pyridoxal 
phosphate enzyme. Threonine, the end product of the 
sequence, is an inhibitory modulator of aspartate 
kinase- This reaction pathway is demonstrated in 
Figure 1. 

Genes encoding the three enzymes, homoserine 
dehydrogenase (horn) , homoserine kinase (thrB) , and 
threonine synthetase (thrC) have been isolated, 
identified, cloned, and their expression modified as 
follows. The cloning and determination of the 
nucleotide sequence of these genes provides a means 
for manipulating the expression and catalytic 
properties of the encoded enzymes. Means for 
altering the expression and the end product include 
in vitro mutagenesis of the C, qlutamicum horn gene 
and selection of derivatives resistant to L- 
threonine mediated feedback inhibition, sequence 
determination of feedback resistant derivatives and 
the use of rDNA technic[ues to combine separate 
genetic alterations, determination and modification 
of the promoter structure and protein start sites 
for hom , thr B, and thrC, increased expression of 
hom - thr B via increased promoter efficiency and 
removal of L-methionine transcriptional repression. 
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and molecular joining- of the C. glutamicum hom-thrB 
and thrC genes to form a hom - tlir BC operon. 
Isolation of tfa r- and thr-/met- auxotrophs of c. 
glutamiciim . 

A genetic background in which to isolate the 
threonine biosynthetic genes was constructed by 
mutating C. cflutamicum and isolating auxotrophs 
defective in threonine biosynthesis. The C. 
glutamicum is maintained on LB media (lo g~NaCl, lo 
g Bactotryptone, 5 g Yeast extract, i 1. h^O) or 
minimal medium for c. Qlutamicum (mcg) (lo g 
glucose, 7 g {im^)^SO^, 3g K^HPO^, Ig KH^PO^, 0.4 g 
MgSO^.7 H^O, 2 mg FeSO^'^H^O, 2 mg MnS0^.H2O, 1 mg 
Biotin, 10 mg Thiamine, 2 ml trace elements, 1 
l.H^O) . 1.4% agar was added for: plates. The trace, 
elements contained 44 mg Na2B^O^ , .7H2O, 20 mg 
f^4^6^°7°27-^^2°' ^ ZnSO^,- 13 5 mg CuSO^.SH^O, 

3.6 mg MnCL2.H20, 435 mg FeCl3 in 500 ml H^O. Where 
appropriate, 50 g/ml L-threonine, 50 g/ml L- 
methionine, 50 g/ml ampicillin, 15 g/ml kanamycin or 
10 g/ml rifampicin were added. 

C. glutamicum AS019, a rifampicin resistant 
variant of ATCC 13 059, was grown at 30 -C in LB to 
exponential phase (2 x 10« cfu/ml) , harvested by 
centrifugation and resuspended in an equal volume of 
minimal media for C- glutamicum (MCG) . cells were 
mutagenized by the addition of nitrosoguanidine 
(NTG) (40 micrograms/ml) to 1 ml of cells and 
incubation without shaking at 3 0''C for 30 minutes. 
Mutagenized cells were harvested by centrifugation. 



/ 
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resuspended in 1 ml LB media and diluted 1:100 into 
10 ml aliquots of fresh LB. Following growth at 
3 0"C with shaking to stationary phase, the cells 
were diluted and plated on LB agar, Auxotrophs were 
screened by replica plating onto MCG plates and 
identification by growth patterns on amino acid 
pools. Only one strain displaying a particular 
auxotrophy such as threonine requirement was saved 
from each of the 10 ml aliquots. 

Twenty four thr- and six thr~/inet- C. gluta- 
micum auxotrophs were isolated. The thr/met- 
auxotrophs grow on MCG plates supplemented with 
homoserine. The thr- auxotrophs may have mutations 
in either of the threonine specific enzymes, homo- 
serine kinase or threonine synthase. 
Transformation and complementation of the C. 
glutamicum auxotrophs . 

Two threonine requiring auxotrophs of C. 
glutamicum , AS155 and AS178, were transformed using 
the following method. An overnight culture of AS019 
was inocculated at a ratio of 1:100 into LB broth 
containing 0.2% glucose and 2 . 0% .glycine . The cells 
were incubated at 30 •'C for 15 hours with aeration. 
10 ml of cells were harvested by centrif ugation and 
washed in SMMC buffer (0.5/M Sorbitol, 2 0 mM MgSO^ , 
20 mM CaCl^, 50 mM Na Maleate, pH 7.0). Cells were 
resuspended in 2 ml SMMC buffer containing 2.5 mg 
lysozyme/ml. The cell suspension was incubated at 
3 7"*C with shaking for 90 minutes. Cells v/ere again 
harvested by centrif ugation at 6000 rpm for ten 
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minutes and resuspended in three ml SMMC buffer. 
0.3 ml aliguots of "protoplasted" cells were placed 
in polypropylene tiibes. Plamid DNA in 0.5 M 
sorbitol was added. 0.7 ml of 40% PEG, molecular 
weight 3 350, 50 mM Tris, 20 mM CaCl^ pH 7.4 was 
added and gently mixed. 2.0 ml of SB broth (0.5 M 
sorbitol, 1 X LB, 20 mM CaCl^, 2 0 mM MgSO^) was 
added to the transformation mixture, which was then 
incubated at 30 'C without shaking for three hours. 

C- glutamicum protoplasts obtained by growth in 
glycine and lysozyme treatment can also be suspended 
in SMMC and frozen at -80 -c for use in subsequent 
trans f oirmations . 

The transformants were plated out on selected 
plates. The two threonine requiring axixotrophs 
AS155 and ASITS were transformed with a C. 
glutamicxm genomic library containing approximately 
2.5 genomic eijuivalents constructed in the C. 
qlutamicum/B. subtilis chimeric plasmid pHY4l6, 
described by Yoshihama et al. , J. Bacteriol. 162, 
591-597 (1985) and Follettie and Sinskey in j. Bac- 
terid . 166 695-702 (1986). Kanamycin resistant 
transformants were selected and screened for comple- 
mentation of the threonine auxotrophy by replica 
plating onto MCG/Km plates. Three AS155 trans- 
formants and a single AS178 transformant were 
capable of growth without threonine supplementation. 
Plasmids were isolated and characterized by restric- 
tion analysis. 
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All four transf ormants harbored the same 
recombinant plasmid, designated pFS78, described in 
Figure 2, which contain a 6,8 kb chromosomal DNA 
insert. The recombinant plasmid was transformed 
into 10 of the independently isolated C. qlutamicum 
threonine auxotrophs and three auxotrophs requiring 
threonine and methionine or homoserine supplemental 
tion. The Km transf ormants were screened for 
complementation on MCG/Km plates. The results 
demonstrated that pFS78 complements all three 
homoserine auxotrophs and four of the ten thr- 
auxotrophs, indicating that the plasmid carries the 
homoserine dehydrogenase gene, horn, as v/ell as one 
of the threonine specific genes, thr B or thr C. 

Two of the thr- auxotrophs not complemented by 
pFS78, AS148 and AS213, were transformed with the 
genomic library and Km"^ colonies screened for growth 
on MCG/Km plates. Both thr+ AS148 and thr+ AS213 
transf ormants were obtained, and their plasmids 
isolated and characterized by restriction analysis. 
All thr+ transf ormants harbor the same 12.5 3cb 
recombinant plasmid designated pFSSO, also shown in 
Figure 2 , containing a 3 . 1 kb chromosomal DNA 
insert. The chromosomal sequence cloned in pFSSO 
complements four other thr- auxotrophs not comple- 
mented by pFS78. However. pFSSO was unable to 
complement the thr- or thr-/met- strains comple- 
mented by pFS78. 

Subcloninq and identification by enzyme assay, 
complementation of auxotrophs, and amino acid 
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secfuence homoloaY of the c. g lutamicum horn. thirB. 
and thrc locus . 

Deletion analysis of pFS78 indicated that both 
the thr- and thr-Zmet- complementing activities are 
located on a 3.6 kb Sail generated chromosomal 
fragment. This fragment was purified by agarose gel 
electrophoresis and electroelution and ligated into 
the unique Sai l restriction site of the C. 
glutamicum /E. coli chimeric vector pWS124, described 
by Batt, Shamnabruch and Sinskey, Biotech. Letts. 
7:717 (1985). The recombinant vector, pFS3.6, 
complements both AS178 (thr-) and AS253 (thr-/met-) . 
The plasmid also complements E. coli thr B auxotroph, 
E. coli 5076. 

The 2.7 Kb S£hl generated chromosomal fragment 
of PFS80 was purified by agarose gel electrophoresis 
and ligated into the unique Sph l restriction site of 
PUC18. The resulting recombinant plasmids, desig- 
nated psPCl and PSPC4, also diagrammed in Figure 2, 
were able to complement E. coli 5077 (thrC) but not 
E- coli 5076 (thrB) . 

Southern hybridization analysis was used to 
determine the relationship of the hom-thrB and thr C 
loci. The results demonstrate that the hom-thrB and 
thrc locus in this species are physically separated 
by a minimum of 8.8 kb. 

The homoserine dehydrogenase activity. in crude 
extracts of wild type AS019 was compared to that of 
the homoserine auxotroph AS 25 3 with and without the 
complementing plasmid pFS3.6 in order to determine 
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the regulation and extent of overproduction of the 
cloned C, qlutamicum hom gene product. Methods used 
for preparing a C, glutamicum crude extract 
preparation and assays for homoserine dehydrogenase, 
homoserine kinase and aspartokinase are as follows: 

Homoserine dehydrogenase is measured by the de- 
crease in absorbance at 340 nm due to the oxidation 
of NADPH (extinction coefficient = 622 0) The 
reaction mixture contains: 3 mM DL aspartate-beta- 
semialdehyde (ASA), 0.4 mM NADPH, 0.1 M PO^, pH 7.0, 
0.5 M KCl, and enzyme preparation, in a total volume 
of 0.7 ml. A blank reaction mixture without DL-ASA 
serves as a control. DL-ASA is synthesized by the 
ozonolysis of DL-allyl glycine according to the 
procedure of Black and Wright, J. Biol. Chem. 213, 
39 (1955). 

Homoserine kinase activity was determined by a 
coupled enzyme assay which measured the reaction 
product ADP. The reaction mixture contained 3.3 mM 
ATP, 0.45 MM NADH, 4 . 5 mM phophenol pyruvate, 1.0 mM 
L-homoserine, 10 mM MgCl^ 12.5 units pyruvate kinase 
(Sigma, St. Louis, MO) , 25 units lactate de- 
hydrogenase (Sigma), 0.25 M KCl, 100 mM HEPES buffer 
(pH 7.8) and enzyme preparation in a total volume of 
1.0 ml. The reaction was monitored by the decrease 
in absorbance at 34 0 nm due to the oxidation of 
NADH. The absorbance decrease in the absence of 
added substrate, L-homoserine, was determined and 
subtracted from values obtained with the complete 
assay mixture. 
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Aspartate kinase activity, inhibited by 
threonine, is detennined by measuring the 
aspartohydroxamate produced according to the pro- 
cedure of Black and Wright, J. Biol. Chem. 213, 27 
(1955) . Protein in the crude extract is precipi- 
tated by adding 5 volumes of sat\irated ammonium 
sulfate and resuspended in 0,3 volume of buffer 
containing o.i m Tris, pH 7.4, 0.2 M KCl. The assay 
mixture contains: o.l m Tris, pH 7.4, 10 mM ATP, lo 
mM Mgso^ 0,6 M hydroxylamine (pH 7.4), 0.6 M (NH^) 
50 mM L-aspartate and enzyme preparation in a total 
volvune of 1 ml. After 1 hr incubation at 37 "C, the 
reaction was stopped by the addition of 1.5 ml of 
solution containing lo% FeCl3.6 K^O , 3.3% trichloro- 
acetic acid and 0.7 N HCl. After centrifugation, 
aspartohydroxamate concentration is measured by 
absorption at 540 nm (extinction coefficient = 600). 
A blank reaction mixture without L-aspartic acid 
seirves as a control. 

Protein concentration of the crude extracts is 
determined using the Bio-Rad protein assay with 
bovine serum albvmin standards (BioRad Laboratories, 
Richmond, CA) . 

The C. glutamicum thr'/met" strain AS253 
harboring the parental vector pWS124 had less than 
2.5% of the homoserine dehydrogenasfs activity 
present in the wild type AS019. Introduction of the 
cloned C. glutamicum hom gene present on pFS3.6A 

C. glutcunicum AS253 leads to a twenty-fold 
increase in the specific activity of homoserine 
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dehydrogenase over that observed in wild type C. 
glutamlcum AS019. The orientation of the cloned horn 
gene with respect to the vector affected its expres- 
sion. Crude extracts of AS253 harboring pFS3.6B 
demonstrated an 11-fold increase in homoserine 
dehydrogenase activity relative to wild type. 

The level of aspartokinase in glutamicum 
AS019 harboring either the parental vector pWS124 or 
the recombinant vector pFS3,6 was unchanged over 
that observed in the controls. Further, the 
aspartokinase specific activity was not repressed by 
growth in MCG supplemented with 2,7 mM L-methionine . 
The differential transcriptional control of homo- 
serine dehydrogenase, in combination with the lack 
of increased aspartokinase activity in cells harbor- 
ing pFS3.6 (hom-thrB) , demonstrates that the two 
activities are not catalyzed by a bifunctional 
protein as in coli . The expression of the 
encoded homoserine dehydrogenase is repressed 3,2 
fold by the addition of 2,7 mM L-methionine, 
Expression of the glutamicum thrA gene is also 
repressed by L-methionine, demonstrating that the 
expression of the pFS3.6 encoded horn gene is medi- 
ated by its native promoter/ operator . Expression of 
the cloned C, qlutamicxim thr B gene was similarly 
repressed 2.6-fold by 2.7 mM L-methionine, 

The activity of the homoserine dehydrogenase, 
both chromosomal and plasmid encoded, is inhibited 
by the addition of L-threonine to the assay mixture. 
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Addition of 1 D-threonine or L-metiiionine does 
not affect the homoserine dehydrogenase activity. 

The complete nucleotide sequence of 3704 bp is 
shown in Figure 3 for hom and thrB . Two long open 
reading frames (ORF's) extend from position 907 to 
2329 and from 2312 to 3269. The protein sequences 
of homoserine dehydrogenase and homoserine kinase 
are predicted on the basis of the sequence extending 
from the first potential translation initiation 
codon, either ATG or GTG (position 994) to the TAA 
stop at position 23 3 0 for ORFl and from the ATG at 
2342 to the TAG stop 3269. The predicted proteins 
have molecular weights of 46,43 6 and 32,618 daltons 
for ORFl and ORF2, respectively. A translation 
terminator is present at position 3279 to 3 3ii, 
seven nucleotides downstream of the TAG stop codon. 
This is shown in further detail in Figure 5. The 
sequence forms a strong step-loop structure having a 
stem length of 15 bp and a seven base loop similar 
to the rho-independent terminators from coli. 
The 5' sequence to ORFl has a region strongly rich 
in A:T containing the hom-thrB promoter and site of 
action of the methionine mediated repression. 

The DNA sequence of the chromosomal DNA insert 
in pFSao, encoding threonine synthase (thrC) , was 
also determined by dideoxy sequencing techniques and 
is shown in Figure 4. a restriction map was pre- 
dicted and checked against restriction analysis 
results to corroborate the accuracy of the sequence 
data. Computer aided analysis (UWGCG Programs, uw 
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Biotechnology Center, University of Wisconsin) was 
used to predict the thr C gene within the sequence 
data. These results were compared with in vivo 
genetic deletion analysis. An open reading frame 
extends 5' to GTG at 39 6) marking the amino terminal 
region of thrC. 

The threonine synthase activity maps within the 
1,57 kb Bcll-Stul restriction fragment. The 
computer predicted structural gene sequence, GTG 
(396) to TAA (1881), lies completely within this 
fragment. The Stu I restriction is 176 bp 3 » of the 
preducted translation stop codon. 

Heterospecif ic genetic complementation of the 
£* coli thrC 1001 auxotroph shows that the C. 
qlutamicum thrC gene is expressed in coll. By 
comparison, using computer searches for regions 
similar to coli rihosome binding sites and 
translation terminator sequences, a ribosome binding 
site adjacent to GTG (396) and a significant termina- 
tor-like sequence 35 bp 3 » of the TAA at 1881 were 
identified. Homology was detected between C. 
qlutamicxun and coli thr C regions at both DNA and 
predicted protein sequence levels. Limited 
conservation of DNA sequence was observed between 
the coli thrC gene and the region 400 to 1400 bp 
of the Cj^ qlutamicum thr C sequence. There is 
consistent conservation in the central region 
(residues 100 to 350 of qlutamicum thr C) and the 
carboxy terminal residues 4 30 to 48 0. 
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Identification of the hom-HlirB transcription start: 
site by SI nuc lease mapping and deletion analysis . 

The transcriptional start site for tlie c. 
qlutami cxm hom-thrB genes was identified using SI 
nuclease mapping, as described by Berk and Sharp, 
Cell 12, 721 (1977), The procedure requires the 
isolation and denaturation of a DNA fragment which 
overlaps the promoter and has been label at the 

5 'end of the antisense strand. Hybridization of 
this fragment to its cognate mRNA and subsequent 
digestion with the single strand specific exo- 
nuclease SI results in the degradation of the 3 'end 
of the labled DNA fragment up to the point at which 
it is protected by the HNA. The size of the result- 
ing DNA fragment is determined by comigration with 
DNA fragments resulting from the sequencing reac- 
tions, of Maxam and Gilbert, Methods in Enzymol. 65, 
499-559 (1982). This enables the identification of 
the transcriptional start site. The results can 
then be confirmed by deletion analysis of the 
promoter using restriction enzymes and exonuclease 
Bal 31 to construct series of deletions which are 
then reinserted into the organism and assayed for 
activity. 

The Smal-Hindlll restriction fragment that 
encompasses the hom-thr B promoter/ operator and the 
first seven amino acid residues of the horn gene 
product was used in the SI nuclease mapping studies. 
Plasmid pRAl (pUClS containing the 3 . 6 kb Sai l c. * 
glutamicum genomic fragment encoding hom-thrB) was 
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cut with Hin dlll to generate a 1.014 kb restriction 

fragment, dephosphorylated with CIP (calf intestine 

phosphatase , Boehringer-Mannhein Biochemicals , 

Indianapolis, IN), labelled by treatment with 

3 2 

polynucleotide kinase and gamma P-ATP (specific 
activity greater than 5000 Ci/mmol, Amersham Corp. 
Arlington Heights, IL) , subsequently cleaved with 
Smal (New England Biolabs, Beverly, MA) to produce a 
2 42 bp DNA fragment that was then purified by 
preparative polyacrylamide gel electropheresis • All 
manipulations were carried out in accordance with 
procedures described in Molecular Cloning by T, 
Maniatis et al. (Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY, 1982) and the enzyme suppliers 
recommendations . 

Understanding the expression of a gene requires 
the isolation and structural characterization of the 
mRNA product, the size and ntimber of transcripts, 
the regulatory control and the site of transcription 
initiation. Criteria evaluated for RNA isolated 
from Corynebacteria include RNA quality (Abs^gQ^ 
Abs--^, ratio of about 1.95 to 2.05), purity 
(degradation and contamination determined by agarose 
gel electropheresis) , and yield in mg RNA/liter 
cells (Abs^gQ = 4 micrograms RNA/ml) . 

RNA was extracted from C. glutamicum AS 019 
using the guanidinium isothiocyanate/ French press 
isolation method. In this method, all. LB culture 
of AS 019 is grown at 30 'C to late exponential phase 
and harvested by 10 minutes centrif ugation at 5000 



SOOCID: <WO_8809819A^.I_> 



wo 88/09819 



PCT/US88/62029 



20 

RPM in a Sorvall GSA-250 rotor. The cells are 
washed at: 4-C in 0.1 M NaCl, 10 mM Tris.Cl, pH 8.0, 
1 mM EDTA, and harvested again- The pellets are 
combined in 50 ml 4 M guanidinium isothiocyanate, 
2-mercaptoethanol (GuT:2ME) and immediately lysed by 
compression through a French press at approximately 
1500 psi. Cell debris is sedimented by centrifuga- 
tion in a Sorvall SS-34 rotor for 10 minutes at 
10^000 rpm. six ml aliquots of the supernatant are 
applied to 4 ml of 5.7 M CsCl, 10 mM ETTA, 25 mM 
sodium acetate and centrifuged at 3 4,000 rpm in a 
BecJcman Ti50 fixed angle rotor for 2 4 hours. 

The density gradient separates the sheared DNA 
molecules from the RNA, which forms a pellet at the 
tube base. This RNA pellet is resuspended in 5 ml 
10 mM Tris, pH 7.5, 1 mM EDTA 5.0% Sarkosyl (TESK) 
containing 5.0% phenols The solution is made 0.1 M 
with 5 M NaCl, and extracted with 10 ml 50% phenol, 
4 9% chloroform, 1% isoamylalcohol (PCIA) . The 
phases are separated by centrifugation in a Sorvall 
SS-34 at 3,000 rpm for 5 minutes and the phenolic 
phase back extracted with TESK containing 0.1 M 
NaCl. The combined aqueous phases are made 0.2 M 
with sodium acetate, pH 5.5, and the RNA precipi- 
tated overnight at -20 in 2.5 volumes of ethanol. 
After centrifugation at 10,000 rpm for 20 minutes at 

the RNA pellet is washed in ethanol, dried 
under vacuum, and resuspended in RNase free water at 
a concentration of 0.5 mg/ml . 
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Total cellular RNA, isolated from AS 019 grown 
in minimal media with and without L-methionine (40 0 
microgram/ml) supplementation, was separated by 
agarose gel electrophoresis, transferred to nitro- 
cellulose paper and probed either with pMF-L2 or 
pUC-B5- Plasmid pMF-L2 contains a 1.8 kb Nael 
fragment which spans both the hom and thr B genes but 
contains no flanking sequences. RNA is glyoxylated 
to prevent spurious electrophoretic patterns caused 
by potential secondary structure. For each lane, 2 0 
micrograms of c. qluramicum RNA is suspended in 8 
microliters of glyoxal reaction mixture (1 M 
glyoxal, 50% DMSO, 10 mM potassium phosphate, pH 
7.0) and incubated 1 hour at 50 'C. Glyoxylated RNA 
samples are prepared for loading by the addition of 
17 microliters formamide, 6.2 microliters formalde- 
hyde, 3 microliters lOx running buffer 0.2 M 
morpholinopropanesulf onic acid (MPOS) , 50 mM sodium 
acetate, 10 mM EDTA) and 5 microliters loading dye 
(50% glycerol, 1 mM EDTA, 0.4% bromophenol blue, 
0.4% xylene cyanol) . Samples are loaded onto an 
agarose/ formaldehyde gel (2.2% agarose, 1 x running 
buffer, 18% formaldehyde, pH adjusted to 7 . 0 with 
NaOH) and electrophoresed at 3 0 mA. Hindlll re- 
stricted lambda DNA is labeled with *^^P, denatured 
and glyoxylated similar to RNA samples and utilized 
as a molecular size standard. Following electro- 
phoresis, nucleic acid is transferred to a nitro- 
cellulose filter using the technique of Southern, J . 
Mol. Biol. 98, 503-517 (1975) except that no prior 
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treatment of the gel was necessary. Following 
transfer for 15 hours, the filters are baked in 
vacuo at 80 'C for 2 hours. 

The filters are prehybridized in sealed plastic 
bags for 16 hours at 42' C in a minimum volxime, 
approximately 10 mis of hybridization buffer (50% 
deionized formamide, 5 x SSC, 50 mM sodium acetate, 
pH 6.5, 25 micrograms sonicated denatured salmon 
sperm DNA, 0-02% bovine serum albxxmin, 0.02% Ficoll, 
0.02% polyvinyl pyrrolidone) . (SCC = 0.15 m NaCl. 
0.015 M sodium citrate, pH 7.0) . The DNA probe is 
labeled with ^^^^^ translation, according to 

Rigby et al., J. Mol. Biol. 113, 237-251 (1977), 
heat denatured, added to the hybridization buffer, 
and incubated with the filter for 20 hour at 42 °C. 
Filters are siibsequently washed five times in 2 x • 
SSC/0.1% SDS at room temperature and then three 
times in 0.2 SSC at 50-c. After drying, the filters 
are exposed to X-ray film and specific bands of 
hybridization determined by autoradiography. 

Hybridization of pMF-L2 to total C. glutamicum 
RNA leads to the appearance of a single 2.4 kb 
transcript. This observation is in agreement with 
the predicted size of the hom -thrB transcript (2408 
base pairs) , based on SI nuclease mapping, and the 
computer predicted termination site of the thrB 
gene. The size of the observed transcript and the 
lack of a detectable second transcript hybridizing 
to the hom-thrB probe leads the conclusion that 
glutamicum expresses hom and thrB from a single 
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transcriptional unit, representing the first defined 
operon in this organism. Results obtained by SI 
nuclease mapping of the thrA-thrB junction support 
this conclusion. 

Hybridization of hom-thrB specific RNA to the 
2 42 bp Sma l- Hind DIII end-labeled probe was achieved 
by lyophilizing 3 0 micrograms C. glutamicum RNA with 
10 ng probe DNTA and resuspending in 10 microliters 
hybridization buffer (40 mM PIPES pH 6.4, 0.4 M 
NaCl, 1 mM EDTA, 80% deionized formamide) . The DNA 
was denatured by heating at 90 *C for 10 minutes. 
Hybridization was performed overnight at 49 "C. 

The hybrid DNA-RNA molecules were digested with 
2,000 units SI nuclease (Bethesda Research Labora- 
tories, Inc., Gaithersburg,. MD) in 235 microliters 
assay buffer (250 mM NaCl, 3 0 mM sodium acetate, 10 
mM zinc sulphate, 2 00 microgram/ml calf thymus DNA) . 
The digest was incubated at 37 "C for one hour and 
terminated by extraction with 25 0 microliters PCIA 
(phenal/chloroform/isoamyl alcohol, 50:48:2). 
Nucleic acids were precipitated from the aqueous 
phase with 0.2 M sodixim acetate, 2 micrograms yeast 
tRNA and 50 microliters ethanol at -20 'C. Following 
centrifugation and drying, each sample was dissolveli 
in 3 microliters formamide loading buffer (100 ml 
formamide, 0.72 g Na^ EDTA, 0.03 g bromophenol blue) 
and applied to a 6% polyacrylamide/7 M urea sequenc- 
ing gel. The 2 42 bp Smal-Hindlll restriction 
fragment was sequenced using the procedures for the 
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G, C+T and c reactions as described in Maxam and 
Gilbert, Methods in Enzvmol. 65, 499-559 (1982). 

The Smal-Hindlll fragment labeled at the 
Hindlli 5 'terminus (antisense strand) acts as a 
specific probe to RNA complementary to this region. 
TO detect the start of the hom-thrB mRNA transcript 
total RNA is hybridized to the labeled probe and the 
unprotected single stranded nucleic acid digested 
with the single strand specific SI nuclease. The 
length of the protected region of the DNA probe is 
resolved on a denaturing 6% polyacrylamide gel. The 
hom-thrB transcript initiates at coordinate 906, at 
the first of a GG doublet. This nucleotide is 88 bp 
5- of the first available ATG codon in the horn open 
reading frame. This defines the promoter .region 
responsible for hom-thrB expression and is 
designated thrPl for threonine promoter 1. The 
sequence is shown in Fxgxxce. Sa. 

NO detectable degradation of the DNA probe from 
the 0.46 Jcb Fokl- PVUJ.1 restriction fragments span- 
ning the horn thTB Pvull junction, indicating that 
the majority of the thrB expression was mediated by 
thrPl . 

Identification and deletion of i:he oner^iin-r 
mediatinq L-methionin e repression of hom-thi-R 
expression. 

In addition to promoter identification, 
restriction and/or exonuclease Bal3l deletions have 
been utilized in identification and deletion of the 
operator (thrO) , which mediates the L-methionine 
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repression of hom-thrB expression and in the con- 
struction of a feedback inhibition deficient variant 
of the horn gene. These studies were facilitated by 
construction of a special vector designated pWSTl. 
When investigating promoter structure and function 
on a plasraid, it is desirable to eliminate read 
through transcription from upstream promoters 
located within the cloning vector* Plasmid pWSTl 
contains the E. coli trp A terminator followed by a 
polylinJcer to facilitate the cloning of the gene in 
the various deletion generated variants. The effect 
of the deletions can be assayed in the absence of 
influence by upstream promoters. This vector is 
applicable not only to. the analysis of the hom-thrB 
genes, but also the characterizations of other 
promoter/ operator systems in C, glutamicum . pWSTl 
is constructed using the trpA terminator obtained 
from Pharmacia Fine Chemicals, Piscataway, NJ. Sac l 
linkers are attached and the trp A terminator in- 
serted into the polylinker region of M13mpl9, The 
constructs are sequenced using the method of Sanger 
et al. Proc. Natl, Acad. Sci. USA 74,5463-5467 
(1977) , to screen for insertion of the terminator in 
the proper orientation. The terminator/poly linker 
is subsequently ligated into the Smal-Sall re- 
stricted pTF3 3, a derivative of the C, qlutamicum/E , 
coli shuttle vector pWS124, described by Batt et 
al, , Biotechnol , Letts , 7:717 (1985), DNA linkers 
and enzymes are obtained from New England Biolabs or 
Boehringer-Mannheim, as noted earlier. 



SOOCID: <WO Ba09B1GA2 I > 



wo 88/09819 



PCr/US88/e2029 



26 

The hom-thrB genes ligated into Smal/Sall re- 
stricted pWSTl on a 2911 bp Smal-Sall restriction 
fragment and a 2,815 bp Dral-Sall restriction frag- 
ment, designated pWFS2*9 and pWFS2.8, respectively. 
Further deletion of the hom-thrB upstream region is 
accomplished as diagramed in Figure 6- The recombi- 
nant vector PWFS2.9 was linearized by Smal digestion 
and deletions constructed by digestion of 6 micro- 
grams of DNA with, the exonuclease Bal31 (0.2 units)/ 
micrograms DNA. Aliquots of the reaction mixture 
were removed at 3 0 second intervals between 4 and 15 
minutes, and the reaction stopped by dilution into 
one volume of 50 mM EDTA. The DNA was digested with 
Sail and the resulting hom-thrB containing fragments 
purified by agarose gel electrophoresis* These 
fragments were ligated into Sma l -Sal l digested pWSTl 
and the resulting recombinant mixture used to 
transform C, glutamicum AS253 (horn) . 

The extent of the Bal31 generated deletions, 
diagrammed in Fig. 7,. in complementing a non- 
complementing derivative plasmid is deteirmined by 
nucleotide sequence analysis and measurement of the 
levels of homoserine dehydrogenase activity in crude 
extracts. The ability of the deletion plasmids to 
complement the hom-thr B auxotrophy of strain AS253 
was checked by streaking the corresponding AS253 
transformants onto MCG/kanamycin agar plates. 

The results of the deletion construction and 
their effect on horn gene expression show that 
deletion of sequences upstream of horn, up to the 
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derivatives which, complement the threonine/ 
methionine auxotrophy and those that fail to comple- 
ment, between 63 base pairs (pWFSdelta23 04) and 56 
base pairs (pWFSdelta2431) upstream of the predicted 
tiom translation start sites. The observation that 
deletions extending to locations beyond the putative 
start point of transcription (88 bp upstream of the 
hom start codon) is determined by SI nuclease 
mapping,, does not necessarily result in the complete 
loss of homoserine dehydrogenase activity. This 
loss may also be due to wealc promoter activity 
adjacent to the main transcription start site. 

The mechanism of transcriptional regulation of 
the hom-thrB operon was determined to involve 
control by the single stem/loop attenuator shown in 
FJLgure 5. Specif ic- deletion of the stem/loop 
structxrre removes the methionine repression of 
hom - thrB expression. In this structure, the se- 
ctuence ATGTAG, encoding Met-Stop, forms the loop. 
The sequence TTTTGGACA, similar to the TTGGAGA that 
precedes the predicted translational start site of 
the hom gene, precedes the ATG and thus represents a 
potential ribosome binding site* A possible model 
is one in which a bound ribosome can be momentarily 
stalled due to a low concentration of charged 
methionine tRNA, thus preventing stem/loop formation 
and allowing transcription to continue. At higher 
concentrations of methionine, the ribosome would 
move to the TAG Stop signal and disengage, allowing 
stem/ loop formation. 



/ 
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Sinal (PWFS2.9) or Oral (pWFS2.8) restriction site 
(218 and 124 bp prior to the predicted translational 
start site, respectively) does not drastically 
influence the expression of the horn gene. 

As predicted from the SI nuclease studies, the 
Dral-Sall hom-thrB fragment contains both the -10 
and -35 regions which are critical for promoter 
activity in E> coli , as reported by Hawley and 
McClure, Nucleic Acid Res. 11 :223 2 (1983). Further 
deletion by Bal 31 markedly reduces the expression 
of the hom gene product homoserine dehydrogenase, 
supporting the data obtained with SI nuclease • 
mapping of thrPl. Two of the deletion derivatives, 
pWFSdelta2304 and pWFSdelta2207 are able to comp-le- 
ment C. -glutamicum AS253 despite the reduction in 
horn 'expression to 2 and 10%, respectively, of that 
observed in strains containing the parental plasmid 
pFS3.6. The relative specific activity of homo- 
serine dehydrogenase observed in C. glutamicum AS253 
(horn) harboring these, two deletion derivatives is 
3.1 and 0.7 with respect to that observed in wild 
type strains. C. glutamicum auxotrophs reguiring 
threonine/methionine express approximately 2% of the 
wild type level of homoserine dehydrogenase activ- ~ 
ity. The deletion of the hom promoter carried in 
pWFSdelta2431 results in a 96-fold decrease in the 
expression of the cloned thr B gene thus demonstrat- 
ing a common promoter. 

The Bal31 generated deletions enable the 
mapping of the boundary between those deletion 
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LacZ protein fusions can be used to evaluate 
* promoter and operator functions directly in C. 

qlutamicum . This method is demonstrated using the 
hom-thrB promoter/operator, isolated on a 33 0 bP 
Smal-Hindlll fragment purified and ligated into 
similarly digested pSKSlOT that contains a 
promoterless lactose operon* The construction 
creates a hom - lac Z protein fusion containing the 
N-terminal eight hom amino acid residues preceding 
the lac Z gene product beta— galactosidase • The 
expression of the fusion protein products required 
the insertion of a ribosome binding site, initiating 
codon (ATG/GTG) , under the control of the hom - thr B 
promoter/ operator • The recombinant vector was 
introduced into E. coli JM8 3 where beta- 
galactosidase activity as' observed in crude ex- 
tracts. Supplementation of the growth medium with 
L-methionine represses the expression of lac 2 
two-fold. 

Deletion of a portion of the dyad symmetry 
element required for operator function demonstrates 
the role of the dyad symmetry element in the regula- 
tion of the hom - thr B gene. A 1.48 kb Kpn l 
restriction fragment containing the hom gene of the 
Bal31 deletion derivative, pWFSdelta2207 was 
purified and used to replace its counterpart in the 
parental vector pFS3.6A. The resulting recombinant 
plasmid/ designated pWFS2207deltal, contains a 
n specific 10 base pair deletion removing the left 

half of the dyad symmetry element. Identical levels 
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of homoserine dehydrogenase activity were measured 
in strains grown in MCG medium with and without 
methionine supplementation. This demonstrates that 
the dyad symmetry element is the site of 
L,-methionine repression (thrO) . 

Jdentif ication of the thrC promoter hv deletion 
analys is, complementation of a\ixotrophs and 
overproduction of the enzyme . 

The promoter sequence for expression of thr C 
was also determined by deletion analysis, 
auxotrophic complementation, and overproduction of 
enzyme. 

The overproduction of the product of the thrC 
gene, threonine synthetase, can be measured from 
crude extracts of C. glutamicum strains AS213 and 
wild type AS019 containing the parental vector 
pHY416 or the thr C containing pFSSO. The two 
strains containing the plasmids are grown in MCG 
medixim, the cells harvested, lysed, cell debris 
removed by centrifugation, the protein purified by 
40 to 60% ammonium sulphate fractionation, DEAE- 
Sephadex column chromatography with a 0.2- M to 0.2M 
to 0.6 M KCl gradient, and anion exchange 
cnromatography in a FPLC column dluted with a 0 . 1 M 
to 0.7 M KCl gradient. The results demonstrate that 
tbe protein is produced at a level 200% of that 
observed in the wild type. The method produces 
threonine synthetase specific activity demonstrating 
a purification of over 350-fold. The protein has a 
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molecular weight of 56,000 by SDS polyacrylamide gel 
electropheresis • 

Analysis of activity complementation of the 
thrC auxotroph indicates that the promoter for the 
thrC gene sequence precedes the predicted transla- 
tion start site for the thr C gene product by 
approximately 8 0 base pairs. The sequence, TTGAAA — 
(16 bp) — TAGGGT, is closely related to the E, coli 
consensus sequence as well as the promoter sequence 
determined for C, glutamicum thr Pl , AAAGCA — ISbp — 
TATAGT, Confirmation of the identification of the 
sequences the thr C gene promoter sequence is done by 
SI nuclease analysis. 

Modification of the enzyme structure and expression 
of horn, thrB and thrC . 

Once the horn, thr B and thr C genes are identi- 
fied, including the identification if the initiation 
sites of both mRNA and protein synthesis for the 
genes, it is possible to increase the quantity of 
gene expression by increasing the gene dosage by 
localization of specific genes on a multicopy 
plasmid, by site-directed mutagenesis or replacement 
of the promoter, by increasing translational effi- 
ciency through alteration of the ribosome binding 
site, or by increasing stability of the protein by 
site-directed mutagenesis. The quality of the 
particular gene can be increased using in vitro and 
site-directed mutagenesis to alter substrate 
utilization as well as the kinetic and regulatory 
properties of the enzyme. Physical properties such 
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as heat stability can also be modified. The 
construction of a vector with the three genes 
transcribed as a single iinit under the control of a 
high efficiency promoter results in more efficient 
threonine synthesis. one can also remove the 
L-threonine feedback inhibition of homoserine 
dehydrogenase or the threonine and lysine inhibition 
of aspartokinase to produce the overproduction of 
threonine. The feedback inhibition of homoserine 
dehydrogenase can be removed by in vitro mutagenesis 
using either hydroxylamine and/or sodiiam bisulfite, 
methods well known to those skilled in the art, or' 
by recombinant techniques. The mutagenized plasmids 
are reintroduced into glutamicum and screened for 
AHV resistance or by enzyme assays. Increased 
promoter efficiency can also be accomplished by site 
directed mutagenesis of the existing promoter or by 
replacement with a high efficiency promoter. 

The thrC gene can be placed \inder the trans- 
criptional control of. high efficiency promoters such 
as the E. coli promoter tac to produce elevated 
levels of the gene product. The expression vector 
PKK23 3 -2, obtained from Pharmacia Fine Chemicals, is 
restricted with Ncol-Hindlll. The plasmid PFS80 is 
cleaved with Bell, blunt ended with Klenow 
polymerase, and Ncol linkers ligated onto the flush 
ends. The ligation product is double digested with 
Ncol and Hindlll, a 2 . 8 kb fragment purified and 
ligated into similarly digested pKK233-2. The 
resulting recombinant vector designated pKC14 is 
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transformed into E. coli JM105, The requirement for 
IPTG (isopropyl-beta-D-thio-galactopyranoside) 
induction demonstrates that the thr C gene is under 
the transcriptional control of the tac promoter. 
The threonine synthetase activity measured in the 
absence of IPTG was 1,3 nmole/min/mg protein. The 
addition of 2 mM IPTG induces 24 times the threonine 
synthetase (30.9 nmole/min/mg-protein) . 

The homoserine dehydrogenase and homoserine 
kinase polypeptides encoded by the open reading 
frames corresponding to the horn and thr B gene 
products can be expressed and purified for analysis. 
The enzymes are purified from 10 liters of MCG broth 
from a CHEMAP fermentator innoculated with a 300 ml 
overnight culture of C> glutamicxim AS019/pFS3.6 
grown for 24 hours at 3 0°C with 470 rpm agitation. 
Cells were harvested by ultrafiltration and centri- 
fugation, the cell pellet resuspended in lysis 
buffer (100 mM KPO^, pH 7.0, 0.5 M KCl) and the 
cells lysed by repeated passage through a French 
pressure cell. Debris is removed by centrif ugation, 
the supernatant precipitated with ammonium sulphate, 
and the enzyme activities separated on a DEAE- 
Sephadex A-50 column eluted with a linear 0.3 M to 
0,8 M KCl gradient. Fractions containing the 
appropriate enzyme activity are pooled and the 
proteins analyzed by SDS polyacrylamide electro- 
phoresis. The proteins are then further purified on 
hydroxylapatite HPLC prior to final separation by 
preprative SDS-PAGE, The purified homoserine 
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dehydrogenase remained active through the procedure 
and has a final molecular weight of 47,000 daltons. 
The activity of the homoserine kinase is lost, but 
the protein had a molecular weight of 32,000. 

The observed molecular weights are in close 
agreement with the molecular weights predicted from 
the nucleotide sequences- The terminus of the 

homoserine dehydrogenase is blocked, however, the 
amino acid composition is in good agreement with the 
amino acid composition predicted from the gene 
sequence. The expression of activity by ligation of 
the Smal -Hindu 1 fragment containing the c. 
glutamicum thrPl and predicted NH^ ^terminal seven 
amino acid residues of the hom gene product indicate 
that the N-terminal sequence is correct. The first 
ten residues of the thr B gene product, homoserine 
kinase, is in complete agre^ent with the predicted 
amino acid sequence. This identifies the 
translation initiation site for the thr B gene as the 
ATG at nucleotide 23 42, confirming the predicted 
primary structure of the C. glutamicum homoserine 
kinase. The protein appears to Tindergo post- 
translational removal of the N-f ormyl-MET, a rela- 
tively common feature of procaryotic proteins - 
construction and application of a c. glutamicum 
promoter probe . 

A promoter probe for use in identifying, 
isolating, and c[uantifying the efficiency of 
promoters is based on pWSTl and designated pAL-i. 
The chloramphenicol acetyl transferase (CAT) gene is 
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used as the promoter probe gene. The cat gene is 
expressed in E. coli. , B. subtilis , and C. 
qlutamicum * Since cloning of strong promoters can 
induce plasmid instability by transcriptional 
interference at the replication origin, it may be 
necessary to clone the fd enteric bacteriophage 
major gene terminator at the 3* terminus of the test 
gene. Potential promoter sequences can then be 
screened for the acquisition of chloramphenicol 
resistance. Preliminary estimation of promoter 
efficiency can be accomplished by deteirmining the 
extent of antibiotic resistance. 

A number of promoter sources can be screened 
for their efficiency in C* qlutamicum . High effi- 
ciency promoters from other procaryotic systems are 
Jcnown, for example, E. coli trp , the hybrid Ptac , 
lambda pH, and B. subtil is Preg. Random C. qluta- 
micum chromosomal DNA and/or corynephage DNA frag- 
ments can also be inserted into the pclylinker site, 
upstream in the test gene and cat activity deter- 
mined to assess promoter efficiency. DNA sequencing 
and determination of the transcriptional start sites 
are used to characterize the promoter structure. 
For example, the 266 bp Dral-Haelll fragment which 
spans the predicted thr C promoter region was 
purified and ligated into the Smal restriction site 
of the pAIj-1 polylinker. The resulting recombinant 
mixture was introduced into E. coli and 
chloramphenicol resistant transf ormance obtained. 
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The primary site of metabolic regulation of 
threonine biosynthesis in C. crlutamicum is the 
L- threonine inhibition of homoserine dehydrogenase. 
To remove the metabolic block, in vitro autogenesis 
has been used to alter the hom gene product to 
produce feedback inhibition deficient variants. The 
recombinant plasmid pWFSdelta2207 was used as a 
source of the hom gene. This plasmid expresses a 
lower level of homoserine dehydrogenase (io%) than 
PFS3.6A, eliminating potential artifacts due to the 
overproduction of the hom gene product. Plasmid DNA 
was digested with Kgnl, separated by agarose gel 
electrophoresis and the fragments isolated by 
electroelution into dialysis bags. The 1.43 kb ^pn l 
fragment containing the hom gene was purified and 
treated with hydroxy 1 amine. This is a potent 
mutagen primarily causing AT to GC and GC to AT 
transitions. The mutated hom gene was isolated and 
3 micrograms of target DNA resuspended in 280 
microliters of 1 M hydroxylamine , 0.3 M KPO^ , pH 
6.0, aliquots removed at between 10 and 300 minutes 
and the reaction stopped by ethanol precipitation. 
The mutagenized fragments were religated to the 
large, 12.7 kb, Kenl restriction fragment and 
transformed into E. coli JM83. 

Plasmid DNA from ampicillin resistant trans- 
formants was purified on CsCl gradients and trans- 
formed into the restriction deficient C. glutamicum 
AS019-E12. These transf ormants were then screened 
for resistance to alpha -aminohydroxyval eric acid 



OOCiEJ: <WO_8a09ei©A2_l_> 



wo 88/09819 



PCT/US88/a2029 



37 

(AHV) in order to select for a deregulated horn gene 
product. Homoserine dehydrogenase activity assays 
were used to confirm and demonstrate the removal of 
the L- threonine mediated feedback inhibition. 
Different mutations could be combined by recombinant 
DNA techniques to determine the extent to which they 
are cooperative. 

The present invention, nucleotide sequences 
encoding threonine biosynthetic enzymes, and methods 
and sequences for the expression and regulation of 
expression of these enzyme encoding sequences are 
<iisclosed. Modifications and variations of this 
invention will be obvious to those skilled in the 
art of genetic engineering from the foregoing 
detailed description. It is intended that these 
modifications .and variations will fall within the 
scopes of the appended claims. 
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WE CLAIM: 

1. A method for the production of threonine 
comprising: 

providing nucleotide sequences for the 
genes encoding the enzymes in the threonine 
biosynthetic pathway; and 

inserting the enzyme encoding sequences 
with selected nucleotide sequences mediating 
the expression and regulation of the enzyme 
encoding sequences into an expression vector. 

2. The method of claim 1 further comprising 
inserting said expression vector into an 
expression host. 



3. 



The method of claim 2 wherein said expression 
host is Corynebacterium .' 



4. The method of claim 1 wherein said nucleotide 
expression and regxilation sequences include a 
promoter, further comprising selecting a 
promoter having a higher efficiency than the 
promoter associated with the enzyme encoding 
chromosomal genes and selecting for greater 
efficiency. 

5- The method of claim 4 wherein said higher 

efficiency promoter is obtained by mutating the 
promoter associated with said enzyme encoding 
genes. 
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6. The method of claim 4 further comprising 
selecting said high efficiency promoter from 
the group of promoters isolated from 
Escherchia , Bacillus ^ Staphylococcus and 
Streptococcus . 

7. The method of claim l further comprising 
selecting a multicopy plasmid as the expression 
vector. 

8. The method of claim 1 wherein said expression 
and regulation sequences include a ribosome 
binding site, further comprising selecting for 
a ribosome binding site with increased 
efficiency. 

9 . The method of claim 1 further comprising 
combining said enzyme encoding nucleotide 
sequences in a single expression vector. 

10- The method of claim 1 further comprising 
mutating said enzyme encoding nucleotide 
sequences and selecting for temperature 
stability. 

11. The method of claim 1 further comprising 
mutating said enzyme encoding nucleotide 
sequences and selecting for substrate util- 
ization . 
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12. 



The method of claim 1 wherein said expression 
and regulation signals include a repressor, 
further comprising modifying said repressor. 

13. The method of claim 12 wherein said repressor 
is deleted. 

14. The method of claim 12 wherein said repressor 
is mutated, 

15. The method of claim 12 wherein said repressor 
is replaced with a repressor other than the 
repressor associated with the chromosomal gene 
encoding said sequence. 



16. 



A method of constructing a promoter probe for 
Corynebacterium amino acid genes comprising: 
isolating nucleotide sequences encoding a 
detectable protein product involved in 
threonine biosynthesis in a Corynebacterium 
host, 

constructing deletions of the 3 • end of said 
nucleotide secjuences, 

inserting said deletions into an expression 
vector, 

transforming said vector into an auxotrophic 
Corynebacterium host, and 

determining if said deletions produce protein 
in auxotrophic Corynebacterium hosts. 
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17. The method of claim 16 further comprising 
sequencing the nucleotide sequences comple- 
menting the auxotrophic Corynebacterium hosts . 

18. A rDNA sequence comprising horn, thr B^ and thr C . 

19. The rDNA sequence of claim 18 further 
comprising a promoter sequence, 

20- The rDNA sequence of claim 19 further 
comprising a repressor sequence* 

21. The rDNA sequence of claim 19 wherein said 
promoter sequence is selected from sequences 
having a higher translational efficiency than 
the sequences associated with the chromosomal 
DNA. 

22. A Corynebacterium rDNA promoter sequence 
comprising a ribosome binding site TTGGAGA. 

23. A nucleotide sequence hybridizing to a rDNA 
sequence encoding homoserine dehydrogenase in 
Coirynebacter ia . 

24. A nucleotide sequence hybridizing to a rDNA 
sequence encoding homoserine kinase in Coryne- 
bacteria. 
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I 

A nucleotide sequence hybridizing to a rDNA 
sequence encoding tlrreonine synthase. 

A Coryneba cteriim rDNA translation termination 
sequence comprising 

AAGGAAGGCCCCTTCGAATCAAGA 
AGGGGCCTT. 

^ Corvneb acterium rDNA translation termination 
sequence comprising 

G,ATGGAACCAGGCCTTTCGCATTG 
AGTGGCGTTTTAAGGCCTCCA. 

^ Corvneb acterium rDNA sequence repressing 
translation in the presence of excess 
methionine comprising 

TTTGTTTTGGACAC A T G T T C TAG G 

met stop 

G T G G C C G A A A C A A A. 
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FIGURE 3 



1 GTCGACCGCGTGAAGTCGCCCTTTAGGAGAATTCTGACTAACTGGAGCCAAAACTTGATC 60 

61 CACTCGAGAGCTGTGCAGTCTCTTTTTCCTTCAATTCTGCCTGCTCGAGCTCGTAGAAGT 120 

121 AGAGGTCTACTTCAGTTGGTTCACCrrTGCACACAAGCATGAAGTAGTGGGTAGGTCGAGT 180 

181 TGTTAAATGCGGTGTAGAAGGGGAGTAGTTCGCTAGCAAAGGTTAATTTGGAGTCGCTGT 240 

241 ACTGCGGGTTCTCGGGTGGAGTATTCCCGGAGGATTCAAGAAATCTTGACGCATCTTTGA 300 

301 TGAGGTATGTTTGGAATTCGTCGGCACCTTCCTCGCCGGAGAGGTAGTAGGAGTTGTCGT 360 

36 1 AATTTGGAACCCAGATGGCAAATCGTGCGTTTTCGATTGCGTCCAGGACTTCCTCTACGT 420 

421 TGTATCTCGCACTTGTTGCAGCGGAAGCGACTCGGTTGCCGATGTCTCCGTATGCAGTGA 480 

48 1 GCGTGGCGTTTCCGAGGGGAACTTGATCAGAGGAATACACCATGGAGCCGATGTCAGAGG 540 

541 CGACTGCGGGCAGATCCTTTTGAAGCTGTTTCACAATTTCTTTGCCCAGCT * 600 

601 TCTGGAACCACTTTTGCATGCGATCGTCGTCAGAGTGGTTCATGTGAAAAATACACTGAC 660 

66 1 CATCTCAATGGTCATGGTGAAGGCCTGTACTGGCTGCGACAGCATGGAACTCAGTGCAAT 720 

721 GGCTGTAAGGCCTGCACCAACAATGATTGAGCGAAGCTCCAAAATGTCCTCCCCGGGTTG 780 
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FIGURE 3 (CONT'D) 



781 ATATTAGATTTCATAAATATACTAAAAATCTTGAGAGTTTTTC CGTTGAAAACTAAAAAG 840 



84 1 CTGGGAAGGTGAATCGAArTTCGGGGCTTTAAAGCAAAAATGAACAGCTTGGTCTATAGT 900 



901 GGCTAGGTACCCTTTTTGTTTTGCACACATGTAGGGTGGCCGAAACAAAGTAA^^ 961 



MetThrSerAlaSerAlaProSerPhe 
961 ACAACGCTCGACCGCGATTATTTTTGGAGAATCATGACCTCAGCATCTGCCCCAAGCTTT 1020 

Translation Initiation Codon 

AsnProGlyLysGlyProGiySerAlaValGlylleAlaLexiLeuGlyPheGlyThrVal 
1021 AACCCCGGCAAGGGTCCCGGCTCAGCAGTCGGAATTGCCCTTTTAGGATTCGGAACAGTC 1080 



GlyThrGluValMetArgl^uMetThrGluTyrGlyAspGluLeiiAlaHisArglleGly 
1081 GGCACTGAGGTGATGCGTCTGATGACCGAGTACGGTGATGAACTTGCGCACCGCATTGGT 1140 



GlyProLeuGluValArgGlylleAlaVaiSerAspIleSerLysProArgGluGlyVai 
1141 GGCCCACTGGAGGTTCGTGGCATTGCTGTTTCTGATATCTCAAAGCCACGTGAAGGCGTT 1200 



AlaProGluI^iiI^uThrGluAspAlaPheAlal^iiIleGluArgGluAspValAspIle 
1201 GCACCTGAGCTGCTCACTGAGGACGCTTTTGCACTCATCGAGCGCGAGGATGTTGACATC 1260 



ValValGluVallleGlyGlylleGluTyrProArgGluValValLeuAlaAlaLeuLys 
1261 GTCGTTGAGGTTATCGGCGGCATTGAGTACCCACGTGAGGTAGTTCTCGCAGCTCTGAAG 1320 



AlaGlyLysSerValValThrAlaAsnLysAlaLeuValAlaAlaHisSerAlaGluLeu 
1321 GCCGGC^GTCTGTTGTTACCGCCAATAAGGCTCTTGTTGCAGCTCACTCTGCTGAGCT^ 1380 



AlaAspAlaAlaGluAlaAiaAsnValAspLeuTyrPheGluAlaAlaValAlaGlyAla 
1381 GCTGATGCAGCGGAAGCCGCAAACGTTGACCTGTACTTCGAGGCTGCTGTTGCAGGCGCA 1440 
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FIGURE 3 (CONT'D) 



IleProValValGlyProLeuArgArgSerLeuAlaGlyAspGlnlleGinSerValMet 
1441 ATTCCAGTGGTTGGCCCACTGCGTCGCTCCCTGGCTGGCGATCAGATCCAGTCTGTGATG 1500 

, ^iy^^^^a^^^GlyThrThrAsnPhelleLeuAspAlaMetAspSerThrGlvAlaAap 
1501 GGCATCGTTAACGGCACCACCAACTTCATCTTGGACGCCATGGATTCCACCGGCGCTGAC 1560 

TTrAlaAapSerLeuAlaGluAlaThrArgLeuGljrTTrAlaGluAlaAspProThrAla 
1561 TATGCAGATTCTTTGGCTGAGGCAACTCGTTTGGGTTACGCCGAAGCTGATCCAACTGCA 1620 

AspValGluGlyHiaAspAlaAlaSerLysAlaAlalleLeuAlaSerlleAlaPleHis 
1621 GACGTCGAAGGCCATGACGCCGCATCCAAGGCTGCAATTTTGGCATCCATCGCTTTCCAC 1680 

, . o , ^^^S^^^^'^AlaAspAapValTyrCysGluGlylleSerAsnlleSerAlaAlaAsp 
1681 ACCCGTGTTACCGCGGATGATGTGTACTGCGAAGGTATCAGCAACATCAGCGCTGCCGAC 1740 

, , , , ^«<51"^^^^l»<51iiAlaGlyHi8ThrIleLysLeuLeuAlaIleCysGluLy sPhe 
1741 ATTGAGGCAGCACAGCAGGCAGGCCACACCATCAAGTTGTTGGCCATCTGTGAGAAGTTC 1800 

, , ^^^"^ysGluGlyLysSerAlalleSerAlaArgValHlsProThrLeuLeuProVal 
1801 ACCAACAAGGAAGGAAAGTCGGCTATTTCTGCTCGCGTGCACCCGACTCTATTACCTGTG 1860 

, , ^®^"^^^^°^"^^SerValAsnLysSerPheAsnAlaIlePheValGluAlaGluAla 
1861 TCCCACCCACTGGCGTCGGTAAACAA6TCCTTTAATGCAATCTTTGTTGAAGCAGAAGCA 1920 

, « . , ^^^yA^g^""etPheTyrGlyAsnGlyAlaGlyGlyAlaProThrA±aSerAlaVai 
1921 GCTGGTCGCCTGATGTTCTACGGAAACGGTGCAGGTGGCGCGCCAACCGCGTCTGCTGTC 1980 

, , ^^lyAspValValGlyAlaAlaArgAsnLyaValHlsGlyGlyArgAlaProGlyGlu 
1981 CTTGGCGACGTCGTTGGTGCCGCACGAAACAAGGTGCACGGTGGCCCTGCTCCAGGTGAG 2040 
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FIGURE 3 (CONT'D) 



SerThrTyrAlaAsnLeuProIleAlaAspPheGlyGluThrThrThrArgTyrHiaLeu 
2041 TCCACCTACGCTAACCTGCCGATCGCTGATTTCG ^TGAGACCACCACTCGTTACCACCTC 2 100 

AspMetAspValGluAspArgValGlyValLeuAlaGluLeuAlaSerLeuPheSerGlu 
2101 GACATGGATGTGGAAGATCGCGTGGGGGTTTTGGCTGAATTGGCTAGCCTGTTCTCTGAG 2160 

GlnGlylleSerLeuArgThrlleArgGlnGluGluArgAspAspAspAlaArgLeuIle 
2161 CAAGGAATCTCCCTGCGTACAATCCGACAGGAAGAGCGCGATGATGATGCACGTCTGATC 2220 

ValValThrHisSerAlaLeuGluSerAspLeuSerArgThrValGluLeuLeuLysAla 
2221 GTGGTCACCCACTCTGCGCTGGAATCTGATCTTTCCCGCACCGTTGAACTGCTGAAGGCT 2280 

LysProValValLysAlalleAsnSerVallleArgLeuGluArgAsp 
2281 AAGCCTGTTGTTAAGGCAATCAACAGTGTGATCCGCCTCGAAAGGGACTAATTTTACTGA 2340 

Stop 

Predicated start of thrB translation 

MetAlalleGluLeuAsnValGlyArgLysValThrVaiThrValProGlySerSerAl 
2341 CATGGCAATTGAACTGAACGTCGGTCGTAAGGTTACCGTCACGGTAGCTGGATCrTCT^ 2400 
Translation Initiation Codon 

aAsnl^uGlyProGlyPheAspThrLeuGlyLeuAlaLeuSerValTyrAspThrValGl 
2401 AAACCTCGGACCTGGCTTTGACACTTTAGGTTtGGCACTGTCGGTATACGACACTGTCGA 2460 

uValGluIlelleProSerGlyLeuGluValGluValPheGlyGluGlyGlnGlyGluVa 
2461 AGTGGAAATTATTCCATCTGGCTTGGAAGTGGAAGTTTTTGGCGAAGGCCAAGGCGAAGT 2520 

LProLeuAspGlySerHlsLeuValValLysAlalleArgAlaGlyLeuLysAlaAlaAs 
2521 CCCTCTTGATGGCTCCCACCTGGTGGTTAAAGCTATTCGTGCTGGCCTGAAGGCAGCTGA 2580 

pAlaGluValProGlyLeuArgValValCysHisAsnAsnlleProGlnSerArgGlyLe 
2581 CGCTGA^GTTCCTGGATTGCGAGTGGTGTGCCACAACAACATTCCGCAGTCTCGTGGTCT 2640 
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FIGURE 3 (CONT'D) 



UGlySerSerAlaAlaAlaAlaValAlaGlyValAlaAlaAlaAsnGlyLeuAlaAspPh 
2641 TGGCTCCTCTGCTGCAGCGGCGGTTGCTGGTGTTGCTGCAGCTAATGGTTTGGCGGATTT 2700 



EProLeuThrGlnGluGlalleValGlnLeuSerSerAlaPheGluGlyHisProAspAs 
2701 CCCGCTGACTCAAGAGCAGATTGTTCAGTTGTCCTCTGCCTTTGAAGGCCACCCAGATAA 2760 



nALaMaAlaSerValLeuGlyGlyAlaValValSerTrpThrAsnLeuSerlleAspGl 
2761 TGCTGCGGCTTCTGTGCTGGGTGGAGCAGTGGTGTCGTGGACAAATCTGTCTATCGACGG 2820 



yLysSerGlnProGlnTyrAlaAlaValProLeuGluValGlnAspAsnlleArgAlaTh 
2821 CAAGAGCCAGCCACAGTATGCTGCTGTACCACTTGAGGTGCAGGACAATATTCGTGCGAC 2880 



rAlaLeuVaiProAsnPheHlsAlaSerXhrGluAlaValArgArgValLeuProThrGl 
2881 TGCGCTGGTTCCTAATTTCCACGCATCCACCGAAGCTGTGCGCCGAGTCCTTCCCACTGA 2940 



uValThrHisIleAspAlaArgPheAsnValSerArgValAlaValMetlleValAlaLe 
2941 AGTCACTCACATCGATGCGCGATTTAACGTGTCCCGCGTTGCAGTGATGATCGTTGCGTT 3000 



uGlnGlnArgProAspLeuLeuTrpGluGlyThrArgAspArgLeuHlsGlnProTyrAr 
300 1 GCAGCAGCGTCCTGATTTGCTGTGGGAGGGTACTCGTGACCGTCTGCACCAGCCTTATCG 3060 



gALaGluValLeuProIleThrSerGluTrpValAsnArgLexiArgAsnArgGlyTyrAL 
3061 TGCAGAAGTGTTGCCTATTACCTCTGAGTGGGTAAACCGCCTGCGCAACCGTGGCTACGC 3120 



aALaTyrLeuSerGlyAlaGlyProThrAlaMetValLeuSerThrGluProIleProAs 
31 2 1 GGCATACCTTTCCGGTGCCGGCCCAACCGCCATGGTGCTGTCCACTGAGCCAATTCCAGA 3180 



pLysValLeuGluAspAlaArgGluSerGlylleLysValLeuGluLeuGluValAlaGl 
3181 CAAGGTTTTGGAAGATGCTCGTGAGTCTGGCArrAAGGTGCTTGAGCTTGAGGTTGCGGC 3240 
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FIGURE 3 (CONT'D) 

YProValLysValGluValAsnGlnPro 
3241 ACCAGTCAAGGTTGAAGTTAACCAACCTTAGGCCCAACAAGGAAGGCCCCTTCGAATC^ 3300 

Stop Computer predicted 

3301 GAAGGGGCCTTATTAGTGCAGCAATTATTCGCTGAACACGTGAACCTTACAGGTGCCCGG 3360 
translation Intermi, nation point 

3361 CGCGTTGAGTGGTTTGAGTTCCAGCTGGATGCGGTTGTTTTCACCGAGGCTTTCTTGGAT 3420 

342 1 GAATCCGGCGTGGATGGCGCAGACGAAGGCTGATGGGCGTTTGTCGTTGACCACAAATGG 3480 

348 1 GCAGCTGTGTAGAGCGAGGGAGTTTGCTTCTTCGGTTTCGGTGGGGTCAAAGCCCATTTC 3540 

354 1 GCGGAGGCGGTTAATGAGCGGGGAGAGGGCTTCGTCGAGTTCTTCGGCTTCGGCGTGGTT 3600 

3601 AATGCCCATGACGTGTGCCCACTGGGTTCCGATGGAAAGTGCTTTGGCGCGGAGGTCGGG 3660 

3661 GTTGTTGCATTGCGTCATCGTCGAC 3685 
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FIGURE 4 

1 TGACCGAGAGTTTTTTTGAGCAACTGGATCATTAGATAATTGOT^ 60 

61 ATCACCCGTTATGGAGACCTACTGGAATTGAGCCCAGAAACCGTCGATGTGTGCCTCAAC 120 

121 GTAGGGGTAAAGCCACGGCCCGAGCAGCACCAGCCCGACCGCGAGGACCGAACAACCAAT 180 

181 GAGAACATACAGGTTCCACTTGGACACCGGCGCTGGATTAAGGATTTCAACTGCGGTGAG 240 

241 ATTCTTCTTGTTGTTGTCCTCGAGTTTCGAGAAGCTGGGGTAATCGGGAGCTGTCATCTT 300 

301 TAAAGCACATCCTAAAACCGACAATTGAAAGTGATCAGCAACACTTTAGGGTATCGCGTG 360 

Predicted start of thrC translation 

ValGlyGluTyrCysValThrProT 
361 GGCGAAGTCACCTTTTTCAACATATTTGAGACGGTGTGGGGGAGTATTGTGTCACCCCTT 420 

rlbosome binding sequence 

rpIleGlyLeuTyrProTrpThrThrPheArgProArgAspAlaSerArgThrProAlaA 
421 GGATAGGGTTATATCCGTGGACTACATTTCGACCGCGTGATGCCAGCCGTACCCCTGCCC 480 

rgPheSerAspIleLeuLeuGlyGlyLeuAlaProAspGlyGlyLeuTyrLeuProAlaT 
481 GCTTCAGTGATATTTTGCTGGGCGGTCTAGCACCAGACGGCGGCCTGTACCTGCCTGCAA 540 
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FIGURE 4 (CONT'D) 



hrTyrProGlnLeuAspAspAlaGlnLeuSerLysTrpAxgGluValLeiiAlaAsnGluG 
54 1 CCTACCCTCAACTAGATGATGCCCAGCTGAGTAAATGGCGTGAGGTATTAGCCAACGAAG 600 

lyTyrAlaAlaLeuAlaArgGluVallleSerLeuPheValAspAspIleProValGluA 
601 GATACGCAGCTTTGGCTCGTGAAGTTATCTCCCTGTTTGTTGATGACATCCCAGTAGAAG 660 

spIleLysAlalleThrAlaArgAlaTyrThrTyrProLysPheAsnSerGluAspIleV 
66 1 ACATCAAGGCGATCACCGCACGCGCCTACACCTACC CGAAGTTCAACAGCGAAGACATCG 720 

alProValThrGluLeuGluAspAsnlleXyrLeuGlyHisLeuSerGluGlyProThrA 
72 1 TTCCTGTCACCGAACTCGAGGACAACATTTACCTGGGCCACCTTTCCGAACCCGCAACCG 780 

laAlaPheLysAspMetAlaMetGlnLeuLeuGlyGluLeuPheGluTyrGluLeuArgA 
78 1 CTGCATTCAAAGACATGGCCATGCAGCTGCTCGGCGAACTTTTCGAATACGAGCrrCGCC 840 

rgArgAsnGluThrlleAsnllel^uGlyAlaThrSerGlyAspThrGlySerSerAlaG 
84 1 GCCGCAACGAAACCATCAACATCCTGGGCGCTACCTCTGGCGATACCGGCTCCTCTGCGG 900 

luTyrAlaMetArgGlyArgGluGlylleArgValPheMetLeuThrProAlaGlyArgM 
901 AATACGCCATGCGCGGCCGCGAGGGAATCCGCGTATTCATGCTGACCCCAGCTGGCCGCA 960 

ctThrProPheGlnGlnAlaGlnMetPheGlyLeuAspAspProAsnllePheAsnlleA 
96 1 TGACCCCATTCCAGCAAGCACAGATGTTTGGCCTTGACGATCCAAACATCTTCAACATCG 1020 

laLeuAspGlyValPheAspAspCysGltiAspValValLysAlaValSerAlaAspAlaG 
102 1 CCCTCGACGGCGTTTTCGACGATTGCCAAGACGTAGTCAAGGCTGTCTCCGCCGACGCAG 1080 

luPheLysLysAspAsnArglleGlyAlaValAsnSerlleAsnTrpAlaArgLeuMetA 
1081 AATTCAAAAAAGACAACCGCATCGGTGCCGTGAACTCCATCAACTGGGCACGCCTTATGG 1140 
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laGlnValValTyrTyrValSerSerTrpIleArgThrThxThrSerAsnAspGlnLysV 
1141 CACAGGTTGTGTACTACGTTTCCTCATGGATCCGCACCACAACCAGCAATGACCAAAAGG 1230 



alSerPheSerValProThrGlyAsnPheGlyAspIleCysAlaGlyHisIleAlaArgG 
120 1 TCAGCTTCTCCGTACCAACCGGCAACTTCGGTGACATTTGCGCAGGCCACATCGCCCGCC 1260 



InMetGlyLeuProIleAspArgLeuIleValAlaThrAsnGluAsnAspValLeuAspG 
1 26 1 AAATGGGACTTCCCATCGATCGCCTCATCGTGGCCACCAACGAAAACGATGTGCTCGACG 1320 



luPhePheArgThrGlyAspTyrArgValArgSerSerAlaAspThrHisGluThrSerS 
1321 AGTTCTTCCGTACCGGCGACTACCGAGTCCGCAGCTCCGCAGACACCCACGAGACCTCCT 1380 



erProSerMetAspIleSerArgAlaSerAsnPheGluArgPhellePheAspLeuLeuG 
1381 CACCTTCGATGGATATCTCCCGCGCCTCCAACTTCGAGCGTTTCATCTTCGACCTGCTCG 1440 



lyArgAspAlaThrArgValAsnAspLeuPheGlyThrGlnValArgGlnGlyGlyPheS 
1441 GCCGCGACGCCACCCGCGTCAACGATCTATTTGGTACCCAGGTTCGCCAAGGCGGATTCT 1500 



erLeuAlaAspAspAlaAsnPheGluLysAlaAlaAlaGluTyrGlyPheAlaSerGlyA 
1501 CACTGGCTGATGACGCCAACTTTGAGAAGGCTGCAGCAGAATACGGTTTCGCCTCCGGAC 1560 



rgSerThrHlsAlaAspArgValAlaThrlleAlaAspValHisSerArgLetiAspValL 
1561 GATCCACCCATGCTGACCGTGTGGCAACCATCGCTGACGTGCATTCCCGCCTCGACGTAC 1620 



euIleAspProHisThrAlaAspGlyValHlsValAlaArgGlnTrpArgAspGluValA 
1621 TAATCGATCCCCACACCGCCGACGGCGTTCACGTGGCACGCCAGTGGAGGGACGAGGTCA 1680 



snThrProIlelleValLeuGluThrAlaLeuProValLysPheAlaAspThrlleValG 
1681 ACACCCCAATCATCGTCCTAGAAACTGCACTCCCAGTGAAATTTGCCGACACCATCGTCG 1740 
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FIGURE 4 (CONT'D) 

luAlalleGlyGluAlaProGlnThrProGluArgPheAlaAlalleMetAspAlaProP 
1741 AAGCAATTGGTGAAGCACCTCAAACTCCAGAGCGTTTCGCCGCGATCATGGATGCTCCAT ISOO 

heLysValSerAspLeuProAsnAspThrAspAlaValLysGlnTyrlleValAspAlal 
1 80 1 TCAAGGTTTCCGACCTACCAAACGACACCGATGCAGTTAAGCAGTACATAGTCGATGCGA 1 860 

leAlaAsnThrSerValLys 
1861 TTGCAAACACTTCCGTGAAGTAACTTOCTTTACGCCAAGGCCTG 1920 

1921 GATGGAACCAGGCCTTTCGCATTGAGTGGCGTTTTAAGGCCTCCAArrCTTAG^^ 1980 
Computer predicted te rm i n ator structure, *+term±nation point 

1981 GTTTGACATGGAGGGGTCACAGTCAAGCCGTTAGAAGCGATTCTGGGAGGGCAAGTTTTT 2040 

2041 CGGAGTTGGAGGTCGAATTTCCGCTGAACTGATGGGAACCAGACAGGCGTGACAAGATTG 2100 

2101 GCTAAAAACCTGAAGTTTTGTCACGCCTGTCTGGTTTCCCTCTTGTCGGTGCGAGCGAGT 2160 

2161 CCCTTGAACGACACAGATCGCGCCAAATGGAAGTGTCTGCGACCCCAGAATATTTGATTC 2220 

2221 CCCGGTCCGAGTCGTGCGAAAAATGCTCTGGTTAGTCCTCGATCATCGCAATCGCATCAA 2280 

2281 TTTCCACAGTTGCACCATAAGGAAGCGATGATGCACCCACGAAAGAGCGTGCCGGGCGGC 2340 

234 1 CTTCGAGGAAATGCTCTCGGAATTGCTCGTTGCATTCTTCGCGCAGGCTGATGTCGGTGA 2400 

2401 CAAAGTAAGTGAGTTTCACAACGTCTTTGAGTTCACCACCAGCGGTCGGAGGCGTTCACG 2460 

2461 CATGCGTTCAAGTGCTGCATCAACTGCTTCTTTACGACCGACGACTGGTTGGTAGTCCTT 2520 
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FIGURE 4 (CONT'D) 



252 1 GTCTACTGAAAGAGCGCCGGAGACGAAGATGAAATTTCCGACGCGTTTTGCGGGGGACTT 2580 

2581 ATGGGTGATCATTCGACATGTGGCCAACCATAGCTGTTTCCCCGAAGAGAGTGCCGGAAC 2640 

264 1 AGGCATTTTAGAGGTGGGGGAGCACTTCTTCGTAAATCTGGGTCAGTACTTCGCTTGCTG 2700 

2701 GTCGGCGCTGGATGTTGAAGATGACGTGGTCGATGCCAAGTTCCGAAAGCGGTGGAGGTC 2760 

2761 TTCGATGAGTTCCTGGCTGCCTACCTCTACGCCAGAGTGATTTCTTTGTGGGTGTTTCCT 2820 

2821 TCGGTGAGGTTGAGCCCCATGGAGGAAATCAACAAGGGGCGGGTGCCACCACGGGCTTTG 2880 

2881 TCCCAGAGATCGAGGCGTCCGACTTGAGCTTCAGCGGGGCGGTAGTAGGTTGCCCATCCG 2940 

294 1 TCGGCGTTTCGGGCGATCCATTGCACTGTTTGTCGGGCAGAACCTACAGCGATCATGGGG 3000 

3001 ATCTGAGCTTCAGGTGGCGTGGTTGGCGCAAATTCAAGGTCGGCCCGCATCGCAGGATCC 3060 

3061 TTCGACAAAGCTGCACGCAAAATTGCCCACCCAGACTGAATATCAGCGCGTCGATTGTCT 3120 

3121 AAGCTTTTCGGAAAAATCTCGAATTC 3146 
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"35 

ACTAAAAAGCTGGGAAGGTGAATCGAATTTCGGGGCTTTAAAGCAAAAATGAACAGCTTG 
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mRNA start hyphenated dyad sjnmaetry 

"10 * * > I \ \Met*** • I 

GTCTATAGTGGCTAGGTACCCTTTTTGTTlTGGACACATGTAGGGfG^ 
CAGATATCACCGATCCATGGGAAAAACAAAACCTGTGTACATCCCACCGGCTTTGl^ 

predicted start of thr A translation 

ThrSerAlaSerAla 

AAIAGGACAACAACGCTCGACCGCGATTATTTTTGGAGAATCATGACCTCAGCATCTGCC 
TTATCCTGTTGTTGCGAGCTGGCGCTAATAAAAACCTCTTAGTACTGGAGTCGTAGACGG 
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